Robustified MANOVA with applications in detecting differentially expressed genes from oligonucleotide arrays
نویسندگان
چکیده
MOTIVATION Oligonucleotide arrays such as Affymetrix GeneChips use multiple probes, or a probe set, to measure the abundance of mRNA of every gene of interest. Some analysis methods attempt to summarize the multiple observations into one single score before conducting further analysis such as detecting differentially expressed genes (DEG), clustering and classification. However, there is a risk of losing a significant amount of information and consequently reaching inaccurate or even incorrect conclusions during this data reduction. RESULTS We developed a novel statistical method called robustified multivariate analysis of variance (MANOVA) based on the traditional MANOVA model and permutation test to detect DEG for both one-way and two-way cases. It can be extended to detect some special patterns of gene expression through profile analysis across k (>or=2) populations. The method utilizes probe-level data and requires no assumptions about the distribution of the dataset. We also propose a method of estimating the null distribution using quantile normalization in contrast to the 'pooling' method (Section 3.1). Monte Carlo simulation and real data analysis are conducted to demonstrate the performance of the proposed method comparing with the 'pooling' method and the usual Analysis of Variance (ANOVA) test based on the summarized scores. It is found that the new method successfully detects DEG under desired false discovery rate and is more powerful than the competing method especially when the number of groups is small. AVAILABILITY The package of robustified MANOVA can be downloaded from http://faculty.ucr.edu/~xpcui/software
منابع مشابه
Profound Transcriptomic Differences Found between Sperm Samples from Sperm Donors vs. Patients Undergoing Assisted Reproduction Techniques Tends to Disappear after Swim-up Sperm Preparation Technique
Background Although spermatozoa delivers its RNA to oocytes at fertilization, its biological role is not well characterized. Our purpose was to identify the genes differentially and exclusively expressed in sperm samples both before and after the swim-up process in control donors and infertile males with the purpose to identify their functional significance in male fertility. MaterialsAndMethod...
متن کاملThe Application of a Non-Radioactive DD-AFLP Method for Profiling of Aeluropus lagopoides Differentially Expressed Transcripts under Salinity or Drought Conditions
Aeluropus lagopoides is a salt and drought tolerant grass from Poaceae family, distributed widely in arid regions. There is almost no information about the genetics or genome of this close relative of wheat that stands harsh conditions of deserts. Differential Display Amplified fragment length polymorphism (DD-AFLP) led to the improvement of a non-radioactive method for which many parameters we...
متن کاملFeature extraction and normalization algorithms for high-density oligonucleotide gene expression array data.
Algorithms for performing feature extraction and normalization on high-density oligonucleotide gene expression arrays, have not been fully explored, and the impact these algorithms have on the downstream analysis is not well understood. Advances in such low-level analysis methods are essential to increase the sensitivity and specificity of detecting whether genes are present and/or differential...
متن کاملIdentification of candidate lung cancer susceptibility genes in mouse using oligonucleotide arrays.
We applied microarray gene expression profiling to lungs from mouse strains having variable susceptibility to lung tumour development as a means to identify, within known quantitative trait loci (QTLs), candidate genes responsible for susceptibility or resistance to lung cancer. At least eight chromosomal regions of mice have been mapped and verified to be linked with lung tumour susceptibility...
متن کاملInvestigating the Function of Predicted Proteins from RNA-Seq Data in Holstein and Cholistani Cattle Breeds
This study was performed to determine the digital expression profile of different genes expressed in Holstein and Cholistani breeds as well as to evaluate the performance of predicted proteins derived from differentially expressed genes between these two breeds using RNA-Seq data. For this purpose, the whole mRNA sequence for a blood sample of American Holstein and Pakistani Cholistani cattle p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 24 8 شماره
صفحات -
تاریخ انتشار 2008